Exponentiated Gradient Versus Gradient Descent for Linear Predictors

نویسندگان

Jyrki Kivinen

Manfred K. Warmuth

چکیده

We consider two algorithms for on-line prediction based on a linear model. The algorithms are the well-known gradient descent (GD) algorithm and a new algorithm, which we call EG. They both maintain a weight vector using simple updates. For the GD algorithm, the update is based on subtracting the gradient of the squared error made on a prediction. The EG algorithm uses the components of the gradient in the exponents of factors that are used in updating the weight vector multiplicatively. We present worst-case loss bounds for EG and compare them to previously known bounds for the GD algorithm. The bounds suggest that the losses of the algorithms are in general incomparable, but EG has a much smaller loss if only few components of the input are relevant for the predictions. We have performed experiments which show that our worst-case upper bounds are quite tight already on simple artificial data. ] 1997 Academic Press

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exponentiated Gradient versus Gradient Descent for Linear Predictors Produced as Part of the Esprit Working Group in Neural and Computational Learning, Neurocolt 8556

We consider two algorithm for on-line prediction based on a linear model. The algorithms are the well-known gradient descent (GD) algorithm and a new algorithm, which we call EG. They both maintain a weight vector using simple updates. For the GD algorithm, the update is based on subtracting the gradient of the squared error made on a prediction. The EG algorithm uses the components of the grad...

متن کامل

Prior Knowledge and Preferential Structures in Gradient Descent Learning Algorithms

A family of gradient descent algorithms for learning linear functions in an online setting is considered. The family includes the classical LMS algorithm as well as new variants such as the Exponentiated Gradient (EG) algorithm due to Kivinen and Warmuth. The algorithms are based on prior distributions defined on the weight space. Techniques from differential geometry are used to develop the al...

متن کامل

Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits

We present Exponentiated Gradient LINUCB, an algorithm for contextual multi-armed bandits. This algorithm uses Exponentiated Gradient to find the optimal exploration of the LINUCB. Within a deliberately designed offline simulation framework we conduct evaluations with real online event log data. The experimental results demonstrate that our algorithm outperforms surveyed algorithms.

متن کامل

(Exponentiated) Stochastic Gradient Descent for L1 Constrained Problems

This note is by Sham Kakade, Dean Foster, and Eyal Even-Dar. It is intended as an introductory piece on solving L1 constrained problems with online methods. Convex optimization problems with L1 constraints frequently underly solving such tasks as feature selection problems and obtaining sparse representations. This note shows that the exponentiated gradient algorithm (of Kivinen and Warmuth (19...

متن کامل

An Algorithm for Online Tensor Prediction: DRAFT Do Not Distribute

We present a new method for online prediction and learning of tensors (N -way arrays N > 2) from sequential measurements. We focus on the specific case of 3-D tensors and exploit a recently developed framework of structured tensor decompositions proposed in [1]. In this framework it is possible to treat 3-D tensors as linear operators and appropriately generalize notions of rank and positive de...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Inf. Comput.

دوره 132 شماره

صفحات -

تاریخ انتشار 1997

Exponentiated Gradient Versus Gradient Descent for Linear Predictors

نویسندگان

چکیده

منابع مشابه

Exponentiated Gradient versus Gradient Descent for Linear Predictors Produced as Part of the Esprit Working Group in Neural and Computational Learning, Neurocolt 8556

Prior Knowledge and Preferential Structures in Gradient Descent Learning Algorithms

Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits

(Exponentiated) Stochastic Gradient Descent for L1 Constrained Problems

An Algorithm for Online Tensor Prediction: DRAFT Do Not Distribute

عنوان ژورنال:

اشتراک گذاری